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ABSTRACT 

Self-calibration techniques for analyzing galaxy cluster counts utilize the abundance and the cluster- 
ing amplitude of dark matter halos. These properties simultaneously constrain cosmological parame- 
ters and the cluster observable-mass relation. It was recently discovered that the clustering amplitude 
of halos depends not only on the halo mass, but also on various secondary variables, such as the halo 
formation time and the concentration; these dependences are collectively termed "assembly bias." 
Applying modified Fisher matrix formalism, we explore whether these secondary variables have a sig- 
nificant impact on the study of dark energy properties using the self-calibration technique in current 
(SDSS) and the near future (DES, SPT, and LSST) cluster surveys. The impact of the secondary 
dependence is determined by (1) the scatter in the observable-mass relation and (2) the correlation 
between observable and secondary variables. We find that for optical surveys, the secondary depen- 
dence does not significantly influence an SDSS-like survey; however, it may affect a DES-like survey 
(given the high scatter currently expected from optical clusters) and an LSST-like survey (even for 
low scatter values and low correlations). For an SZ survey such as SPT, the impact of secondary 
dependence is insignificant if the scatter is 20% or lower but can be enhanced by the potential high 
scatter values introduced by a highly -correlated background. Accurate modeling of the assembly bias 
is necessary for cluster self-calibration in the era of precision cosmology. 

Subject headings: cosmology: theory — cosmological parameters — large-scale structure of universe 
— galaxies: clusters: general — galaxies: halos — methods: statistical 



1. INTRODUCTION 

The observed accelerating expansion of the Universe, 
which is often interpreted as evidence of dark energy, is 
one of the most surprising results of modern cosmology. 
In the ACDM paradigm, dark energy governs the late 
time expansion of the Universe, halting the growth of 
structures. Consequently, the evolution of the number of 
massive galaxy clusters provi des one of the most power- 
ful probes of d ark energy fe .g. IWang fc Steinhardtj|1998l: 
Haiman et all 2001: Holde r et all 120011: iLevine et al ' 



^0^ 'Hu '2003; Maiumdar & Mohr '2003; ' Rozo et al 
2007: Gladdcrs et al. 2007 : Mantz ct al. 2007.) . 

Several planned and ongoing surveys will 
identify massive clusters over substantial vol- 
umes using a variety of tec hniques, i nclud- 
ing o ptical gala xy counts (e.g. lY"ork et al.l 120001: 



The Dark Energv Survev Collaboration! |2005|: | Tvson 



20021) , the Suny a ev-Ze l'dovich effect (e.g. iMiLeLal 



2004!; 'Kosowskv* '200 3D, and X-ray emissions (e.g. 
Ebchng et al. 2007; Bu renin et ani2007[ ). These cluster 
surveys will complement a variety of future dark energy 
measurements using tools such as Type la supernovae, 
weak lensing, and baryon acoustic oscillations. Since 
each of these methods is subject to different systematics, 
combining them thus provides cross checks necessary to 
avoid biased inferenc es on the properties of dark energy 
(|Albrecht et al.ll2006l ). 
While the abundance of clusters as a function of mass is 
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well understood from a theoretical standpoint, measuring 
this abundance relies on observable mass tracers. This 
reliance is the single most significant obstacle confronting 
the use of clusters as cosmological probes. In particu- 
lar, the statistical observable-mass relation needs to be 
understood to high accuracy in order to avoid system- 
atic errors in the inference of cosmological parameters. 
Alternatively, additional observable quantities that de- 
pend on halo mass allow one to simultaneously constrain 
cosmology and the aforementioned observable-mass re- 
lation. One such observable quantity is the clustering 
amplitude of clusters, which depends sensitively on mass 
and can be determined through a counts-in-cells anal- 
ysis. This g eneral method is often referred to as "self- 
calibration" (jMaiumdar fc Mohj|2004l:lLima fc Hull2004 
[20051 [2OOI . 

In this work, we explore a possible systematic that 
arises in the self-calibration analysis, namely, the de- 
pendence of the clustering amplitude of halos on sec- 
ondary variables. The clustering amplitude of halos is 
characterized by the halo bias, and recent studies have 
shown that halo bias depends not only on halo mass 
but also on additional halo properties, such as concen- 
trati on, for mation time, spin , substructu re fractio n, etc 
e.g. iGao e t al. 2005; Harkc r et all 120061: IWcchslcr et al 
hite,2007^ .Wetzel et al.ll2007; .Bett et al 



200 



20071 : 



Gao fc W: 



Jing et al 



l2007f ). These dependences are often in- 



terpreted as arising from the different assembly histories 
of halos of the same mass, and we refer to these depen- 
dence s collectively as "assembly bias" (e.g. ICroton et al.l 
|2007( ). If cluster selection is biased with respect to any 
of these variables, the observed clustering amplitude of 
clusters will deviate from the mean clustering amplitude 
of clusters with the same mass distribution. This devi- 
ation will lead to a biased inference of the observable- 
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mass relation, and therefore to biased estimates for the 
cosmological parameters of interest. 

We herein take the secondary parameter to be the hafo 
concentration, which has been shown to play a role in 
halo bia s for massive clusters by Wechsler e t al. (2006; 
see also IWetzel et alJl2007t iJing et al.ll200l . We then 
incorporate the concentration dependence of halo bias 
into the standard s elf-calibration formalism developed in 
iLima k Hul (|2004l . [2005) . With modified Fisher matrix 
formalism, we investigate the impact of this additional 
dependence on cosmological parameter estimates from 
self-calibration. We specifically calculate the expected 
effects for four example galaxy cluster surveys, which 
represent the Sloan Digital Sky Survey (SDSS; assuming 
clusters selected from the photometric data), the Dark 
Energy Survey (DES), the South Pole Telescope (SPT), 
and the Large Synoptic Survey Telescope (LSST). We 
also explore various assumptions about the correlation 
between cluster observable and concentration. In detail, 
the significance of this systematic effect depends on the 
strength of this correlation as well as on the observable- 
mass scatter. We find that the resulting bias in the 
inferred cosmological parameters is insignificant for the 
current SDSS photometric surveys, but it can be signifi- 
cant for upcoming photometric surveys such as DES and 
LSST. On the other hand, for SZ this systematic is less 
likely to be significant if the scatter is small and mainly 
intrinsic, but may still be significant if the correlated 
background dominates the scatter. 

This paper is organized as follows. In ij2]we discuss why 
assembly bias may lead to biased cosmological parame- 
ter estimates in cluster counting experiments. In i)3.1l 
we review the standard self-calibration formalism, and 
then proceed in §3.21 to include assembly bias into this 
formalism. Our statistical methodology for estimating 
the systematic errors due to assembly bias is described 
Details of our implementation can be found in 



fjH Section [5] presents our results and discussion. We 
summarize in ^ 

2. HALO BIAS AND DARK ENERGY: WHY ASSEMBLY 
BIAS MATTERS 

Halo bias characterizes the clustering amplitude of 
dark matter halos, and it is typically defined as the ra- 
tio between the density contrast of halos and that of 
the dark matter. In the hierarchical structure forma- 
tion predicted by CDM, halo bias is a strong function 
of mass, increasing for more massive halos. This depen- 
dence on mass is now well calibrated from numerical sim- 
ulations and can be approximated analytically wi th the 
exc ursion -set theory (e.g. Mo & White 1996; Shct h et al.l 
[20011: ISeliak fc Warreni 120041 : lZentneiil2007D . Halo bias 
depends sensitively on dark energy in a way that is com- 
plementary to the dependence of the mass function on 
dark energy; thus, including the halo bias information in 
cluster counting experiments improves the dark energy 
constraints from using mean halo abundances alone. 

Much work on halo bias has made the simplify- 
ing assumption that halo bias depends only on halo 
mass. However, recent studies based on A^-body sim- 
ulations have found evidence that secondary variables 
such as halo forma tion time and concentration do im- 
pact halo bias (e.g. iGao et al.l [20051: iHarker et al.ll2006t 
iWechsler et all 120061: iGao fc Whitd [20071 IWetzel eTal] 



I2007t iBett et al.l [20071 iJing et all [2001 . In this work, 
we focus on the impact of halo concentration on halo 
bias, principally because among all secondary parame- 
ters, this dependence is the strongest at cluster scales 
and is the best understood statistically. The halo concen- 
tration describes the halo density profile and is defined 
as c = i?vii /?'s, where rg is the radius where the density 
profile has a log slope of —2. The halo concentration has 
been shown t o correlate tightly wit h the halo formation 
epoch by e.g. IWechsler etakl (|2002f) . 

We specifica l ly us e the fitting formula given by 
IWechsler et all (|2006L eq. 6): 

b^'^iM, c) = 6avg(M) X b,{c\M/M,) (1) 

where 6avg(-^) is the mean halo bias at fixed mass, 
6c(c|M/M*) characterizes the concentration dependence 
of halo bias, and is the characteristic mass of gravita- 
tional collapse [quantitatively defined as a{M^) = 1.686, 
where a{M) is the r.m.s density fluctuation inside a 
sphere that encloses mass M]. The superscript "ab" 
refers to "assembly bias," which we use as a generic 
term for the dependence of halo bias on secondary vari- 
ables, based on the conjecture that these dependences 
arise through the different formation histories of halos 
of the same mass. We assume this formula holds for all 
clusters included in our fiducial surveys, although part of 
these clusters are outside the range where this formula 
has been calibrated with simul ations. In addition, we 
note that IWechsler et al.l (|2006f ) calibrated this formula 
with Mvir, while the mass function and the halo bias we 
use are not always well-calibrated with the same mass 
definition. We ignore the systematic errors that may be 
caused by these uncertainties. 

The left panel of Figure [T] illustrates ho w concentration 
impac ts halo bias in the fitting formula of I Wechsler et al.) 
(j2006f) . As can be seen, for M > W^^-^H-'^Mq, low con- 
centration halos are more clustered than high concentra- 
tion ones of the same mass. This difference is potentially 
significant: if the cluster observable is correlated with 
concentration, one might measure cluster bias that dif- 
fers from the mean halo bias for random halos of the 
same mass. 

The right panel of Figure [T] shows how the effect of 
assembly bias can resemble that of a high dark energy 
density, with an extreme assumption of perfectly anti- 
correlated observable and concentration. Cumulative 
bias, which is relevant for halo samples above a certain 
observable threshold (see eq. [iT]), is plotted here. As 
can be seen, if we tend to observe low concentration ha- 
los, the effect of assembly bias (dashed curve) makes the 
observed halo bias higher than the mean halo bias (aver- 
aged over random halos samples of the same mass) for the 
same fiducial cosmology (solid curve). This effect mim- 
ics a high dark energy density ^Ide = 0.9 (dotted curve), 
since high f^DE will make structures rarer and more clus- 
tered. Thus, a wrong inference of r^DE is possible if as- 
sembly bias is ignored in this case. In the following sec- 
tions, we provide detailed formalism and analyses of such 
systematics under the framework of the self-calibration 
of observable-mass relation. 

3. FORMALISM 

3.1. Counts-in-Cells Analysis and Basic 
Self-Calibration: A Review 
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Fig. 1. — Left : Dep endence of halo bias on concentration at z = assumed in this work, based on the fitting formula of b^^ (M, c) 
in lWechsler et aLl 120061 ). We assume a WMAP3 cosmology and log- normally distributed concentrations at a given halo mass. Halos are 
binned by concentration into four quartiles, and the halo bias of each quartile systematically deviates from the average halo bias {solid 
curve). Above 10^'^'^h~^MQ, low concentration halos {red and orange dotted curves) are more clustered than high concentration ones 
{green and blue dashed curves) of the same mass. The bottom panel shows the residual compared with the average halo bias. Right panel: 
Degeneracy between high dark energy density and assembly bias. The solid curve shows the cumulative bias (eq. 1111 . with the selection 
function nonzero above a threshold Mtj,) for the fiducial WMAP3 cosmology. The dashed curve shows the effect of assembly bias with the 
assumption of perfectly anti-correlated cluster observable and concentration (see ^ and 13.21 for details) . This correlation can mimic the 
effect of high dark energy density (here assumed to be Hde = 0.9), shown as the dotted curve. 



In a pixelated galaxy cluster survey, halo bias is related 
to the sample variance of clu ster counts within the small 
sub-volumes of the survey (|Hu fc Kravtsovl [20031) ■ In- 
cluding the sample variance in a counts-in-cells analysis 
allows one to "self-calibrate" the observable-mass distri- 
bution, which is one of the main uncertainties in mod- 
eling the surveys. This approach can thereby improve 
the dark en ergy constraints relative t o "counts only" ex- 
penmeife (iMajumdar fc Mohil 120041 : iLima fc Hul 12004 
l2005l l2007f ). In this section, we review the basic self- 
calibration, closely following the formalism developed by 
[Lima & Hu (2004). 

Given a large-volume survey, consider a redshift slice 
which is sufhciently thin to make evolution ignorable. 
We then divide the area of this slice into equal-area cells 
and count the clusters in each cell.'^ The number of clus- 
ters in cell i, denoted by Ni, is affected by the Poisson 
shot noise, which is modeled as Ni ^ Poisson(mi). This 
Poisson mean rrii varies from cell to cell due to the large- 
scale clustering of matter and halos, and this fluctuation 
can be modeled as a normal distribution ~ N(to, S), 
where ffi is the mean halo abundance and S is the sample 
variance. 

In a given mass range, the mean number counts of 
clusters in cell i depend on m, the bias integrated over 
the mass range 6, and the mass overdensity Si within this 

Here we suppress all redshift dependence in our notation for 
simplicity. In practice, we consider the redshift dependence of the 
mass function, the halo bias, the observable— m ass distribution, an d 
the comoving survey volume. For readers of ILima &: Hul II2004I ) , 
note that our notation is slightly different. Since we consider a sin- 
gle redshift slice, our subscript i indicates the cell label of the same 
redshift, while in ILima fc Hul II2004I) . their subscript i indicates a 
cell of redshift Zi. 



cell with respect to the background: 

im = m(l + b 6i) . (2) 
The sample variance then has the form 
S={{mi - frif) 
= ffiWc7^ , (3) 

where 

4 = j^,w{k)w*{k)p{k) . (4) 

Here P{k) is the matter power spectrum and W{k) is the 
fc-space window function of a cell of volume y, normal- 
ized such that V = J (fxW{x). Applying a counts-in- 
cells analysis, Ni of each cell can be measured, and m 
and S can be obtained from a likelihood analysis. With 
additional knowledge of the matter power spectrum, b 
can be obtained. 

Note that this sample variance should be more rigor- 
ously defined as the sample covariance 

Stj = {{m^ - fh){mj ~ m)) (5) 

- 2T.2 2 

with 

1 r H^k 

4-^ J j^w.ik)w;ik)Pik) . (6) 

In practice, our cell size is much larger than the corre- 
lation length of clusters; thus, the correlations between 
different cells are negligible. The off-diagonal elements 
are therefore much smaller then the diagonal ones, and 
the matrix Sij reduces to a diagonal matrix Sij — 5ijS, 
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whose dimension equals ric, the number of ceUs in the 
redshift shoe. 

We next relate these measurable quantities to theoret- 
ical models. Let Mobs denote the observed mass proxy 
(the observable) of galaxy clusters. Given a differential 
mass function dn/dM and an observable-mass distribu- 
tion P(Afobs|M), the differential observed cluster abun- 
dance is given as 

'^^ dM ^PiAU^lM) . (7) 



dMobs J dM 
In terms of the binning function 0(Mobs) — which is de- 
fined to be equal to unity if Aiobs falls in the bin corre- 
sponding to the observable range, and zero otherwise — 
and the cell volume V, the mean observed cluster abun- 
dance reads 

dMobs ^jr^HMohs) , (8) 

which can be further simplified as 

dn 



m 



V 



dM ^im) 



if we define the selection function to be 

(0|Af) = /dAfobs P(Afobs|M)(/.(A/obs) 



(9) 



(10) 



Given the halo bias b{M), the bias integrated over the 
observable bin similarly reads 



[ dM ^b{M)mi) 
m / dM 



(11) 



From equations [9l and [TT] we can see that if both ffi and b 
are measured in the survey, the selection function {4>\M) 
can be self-calibrated. 

In large- volume surveys, we often have several redshift 
bins and need to consider how rh and S vary with red- 
shift: fh{z) and S{z). The sample variance is then gener- 
alized to the matrix S = diag(Sij (zi), Sy (^2), ...), where 



each Sij{zk) has the dimension ric 



X Uc 



Similarly, m is 



generalized as m = (m(zi), m(z2), ■■■), with each m(2fc) 
being a Uc component vector. For future reference, we 
further define M = diag(m) and C = M + S; C is the 
covariance matrix in the limit of large cluster numbers 
in a cell (m^ ^ 1; see iLima fc Hul[200l . 

Constraints on dark energy parameters are extracted 
from the likelihood function that involves the counts- 
in-cells data, the theoretical mean abundance, and the 
theoretical sample variance. For theoretical forecasts, 
the Fisher matrix — the expectation value of the second 
derivative of the minus log-likelihood function — is often 
applied. For a combination of the Poisson shot noise and 
the Gaussian sam ple variance, the Fisher matrix reads 
(ILima fc Hull2n04f) 



1 



(12) 



where the comma and subscript a indicates the partial 
derivative with respect to model parameter Oa- The 
Fisher matrix approach essentially approximates the like- 
lihood function as a Gaussian distribution near its max- 
imum likelihood point, and the curvature at this point is 
related to the constraints on the model parameters. The 
covariance matrix for model parameters is approximated 
by the inverse of the Fisher matrix. This basic picture 
will play a key role in t j3.31 where we modify the Fisher 
matrix formalism for assessing the systematic errors. 



3.2. Incorporating Assembly Bias into Self- Calibration 

We now incorporate assembly bias into the self- 
calibration formalism. The formalism we outline below is 
relevant for any secondary parameter which both affects 
the halo bias and correlates with the cluster mass proxy. 
We specifically consider the secondary parameter to be 
the halo concentration c and refer to this dependence 
throughout as "assembly bias." Note that although the 
halo concentration and assembly history are generally 
expected to be tight ly correlated (jNavarro et al.l [l997l : 
IWechsler et al.l l2002[ ). they ma y not have exactly the 
same effect on halo bias (see e.g. iGao fc Whiti[2007l) . 

Let b^^{M, c) be the halo assembly bias, which now de- 
pends on both mass and concentration, and let f(c\M) 
be the distribution of concentrations for halos of mass 
M. In this case, the observable-mass distribution 
P{Mohs\M) needs to be generalized to an observable- 
mass-concentration distribution P{Mohs\M,c). With 
the secondary parameter c, the mean abundance fh takes 
the form 



m — V 



JdM dcf{c\MmM,c) 



where 



Af,c} = / dAfobs P(Afobs|Af,c)(/)(A^obs) 



(13) 



(14) 



This mean abundance remains the same as equation [9] 
since the concentration dependence only affects the halo 
bias but not the mass function. We thus require 



dc/(c|Af)(0|Af,c) = (0|A/) 



(15) 



On the other hand, the bias integrated over the observ- 
able range is affected, and the analog of Equation [TT] is 



foab ^ ^ /■ / dc b-\M,c)f{c\M){(t,\M,c) 

m / dM 



The corresponding sample variance in this case reads 



(16) 



cab -2 fiah \2 2 

Si. [b ) cr^. 



(17) 



and we analogously define C^*^ = M -I- S^*^. Replacing 
the corresponding matrices in Equation ll21 we obtain the 
Fisher matrix incorporating assembly bias. 

The difference between P{Mohs\M,c) and P{Mohs\M) 
depends on how A^obs correlates with c. We leave 
these details to §4.11 and simply state here that our 
parametrization depends on the cross-correlation coef- 
ficient r relating Afobs and c at fixed halo mass. When 
r = 0, assembly bias has no impact on self-calibration; 
when r — ±1, the impact of assembly bias is maxi- 
mized. Figure [2] demonstrates the formalism described 
above (with an SPT survey assumption and a WMAP3 
cosmology, see S]) and shows how the correlation be- 
tween Afobs and c changes the constraints on dark energy 
parameters, assuming that we have thorough knowledge 
of assembly bias and that r = ±1 {dotted and dashed 
curves). As can be seen, correlation between Afobs and 
c actually improves the dark energy constraints if r is 
known a priori. This improvement is presumably due 
the dependence of bias on Af*, which is also sensitive 
to dark energy, although we also note that the assump- 
tion of self-similarity in M/M^, needs to be assessed in 
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Fig. 2. — Improvement of dark energy constraints assuming a 
thorough modeling of assembly bias and knowledge of the cross- 
correlation relating M^^^^ — the cluster's mass estimate based on 
a cluster observable — and c, the halo's concentration parameter. 
All error ellipses include the 68% confidence regions in the fiDE^f 
plane. The solid ellipse shows the fiducial model of zero observable- 
concentration correlation (r = 0), in which case assembly bias 
has no effect. The dotted/dashed ellipse corresponds to an ob- 
servable which is perfectly correlated/anti-correlated with concen- 
tration (r = f/— 1). If assembly bias is correctly modeled, the 
sensitivity of assembly bias to Af, slightly improves dark energy 
constraints. 



the dark energy-dominated regime. In addition, with 
the knowledge of r, the scatter in Mobs actually contains 
the information of halo concentration, which may also 
improve cosmological constraints. These extreme cases 
are mainly for demonstration, since we are unlikely to 
have sufficient astrophysical knowledge to specify both 
the assembly bias and this correlation. However, if indi- 
vidual concentrations can be measured for the most mas- 
sive clusters (where the impact of assembly bias is most 
severe), they could provide observational evidence of as- 
sembly bias and increase the efficacy of self-calibration. 

In the following sections, we explore the question: if 
one were to perform the self-calibration analysis ignoring 
the effects of assembly bias (effectively, assuming r = 0) , 
how would the estimated cosmological parameters be bi- 
ased? As we shall see, the answer sensitively depends on 
r and on the scatter in the observable-mass distribution. 
We next include r as a free parameter in the Fisher ma- 
trix analysis and consider the effect of marginalization 
over r. However, a caveat for applying the Fisher matrix 
here is that since r is bound to the range [—1,1], the like- 
lihood function for r may not be well-approximated as 
Gaussian if r is close to ±1. Because the Fisher matrix is 
based on this Gaussian approximation, it may not apply 
to the case when r approaches ±1. On the other hand, 
our fiducial choices of this parameter, which are in the 
range \r\ < 0.5, may circumvent this problem. 

3.3. Biased Parameter Estimation from Ignored 
Systematics: A Modified Fisher Matrix Formalism 

In ij^l we described how ignoring the impact of as- 
sembly bias can potentially lead to biased cosmological 



parameter estimates. In this section, we modify Fisher 
matrix formalism to quantitatively assess the significance 
of this systematic. We focus on how the parameter es- 
timates are biased due to a wrong model assumption, 
and how significant this systematic error is when com- 
pared with statistical uncertainties. This formalism is 
motivated b y the standard Fisher matrix formalism as 
presented in lTegmark et al.l (|1997D . 

We generally consider two models, denoted by model 
A and model B, each of which describes a data set x 
based on a parameter 6. Here 9 can be generalized to 
a vector denoting a set of parameters {9i values). We 
assume that the observed data set x is well described by 
model B but is mistakenly analyzed according to model 
A. If 9t denotes the true parameter in model B that 
corresponds to the observed data set x, we are interested 
in how the estimated parameter 9 recovered based on 
model A differs from 9t. Our quantitative analysis can 
be summarized as follows: 

1. Our starting point is the likelihood function 
La{x\9) for model A. The data set x is as- 
sumed to be drawn from the probability distribu- 
tion PB{x\9t) for model B; in order to relate 9 to 
9t, we take average over x to compute (in LA{9)\9t). 

2. We take the point 9 which maximizes {\nLA{9)\9t) 
as our estimator for the recovered cosmology. This 
step defines the function 9{9t), the recovered model 
parameter varying with the input parameter 9t . We 
are particularly interested in S9 = 9{9t) —9t, which 
is the systematic error in parameter inference due 
to assuming an incorrect model.* 

3. In order to assess the significance of the system- 
atic error 69, we compare it against the statistical 
uncertainty in 9. We calculate the modified Fisher 
matrix F^j{9t) = {d^{-\nLA)/d9^d9j\9t) and ob- 
tain the corresponding error bar ag, — (F^^)^^. 
The systematic error is significant if d9i > ag^ . 

A detailed derivation when both Pa{x\9) and Pb{x\9) 
are Gaussian can be found in the Appendix. 

In this study, model A represents the standard self- 
calibration analysis that ignores assembly bias, while 
model B is self-calibration analysis that includes the im- 
pact of assembly bias. The data set x is the number 
counts in each of the cells under consideration. The sys- 
tematic errors of the recovered parameters are given by 

69, = ^(F-i),,Tr{ic-iC,,C-i(C^'^ - C)} , (18) 

i 

where C^*^ is the covariance matrices with assembly bias, 
and C and F are the same as those in equation [121 The 
modified Fisher matrix reads 

/;,=rT.-C-m, + llr[C-S,.C-S,C-C-], (19) 

in which the modification comes from the change of co- 
variance matrix due to assembly bias (see the Appendix). 

* An alternative approach is to first use Lji{x\9) to compute the 
maximum likelihood estimator 8{x). Since 6 is now a function of 
the data x, one could use Ps(a?|9t) to compute the expectation 
value {8\0t). However, this approach is not analytically tractable. 
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We note that similar formalisms arising from differ - 
ent approaches can be f o und in e.g. iKnox et alJ (Il998h , 
Huterer fc Turnei^ ("200 j^ lHuterer &: Lindeil (|2007t) . and 
Amara fc R cfrcgicr (2003). 

Figure [3] illustrates the results of our formalism as 
applied to the self-calibration analysis for an SPT-like 
survey in the specified WMAP3 cosmology (see g] for 
details of implementation and assumptions). In each 
panel, the open circles indicate the assumed true val- 
ues, while the filled circles show the recovered parame- 
ters from a self-calibration analysis that ignores assem- 
bly bias. The ellipses include the 68% confidence regions 
in the flDE~w plane; the dashed ellipses correspond to 
correctly-modeled assembly bias (assuming that we know 
the correlation coefhcient r a priori; r will be mathemat- 
ically defined in i i4.ip , while the solid ellipses correspond 
to the ignored assembly bias. Note that the shape of 
the confidence regions can also be changed by this sys- 
tematic. The left panel shows the assumption of a small 
Mohs-M scatter and low Mobs-c correlation (uinM = 0.1 
and r = —0.5), and the systematic errors are 0.22 and 
0.23(T for f2DE and respectively; the deviations of the 
parameter estimates are much less than the statistical 
uncertainties. The right panel shows the assumption of 
a larger scatter and perfectly anti-correlated Mobs and 
c (cinM — 0.25 and r = —1), and the resulting system- 
atic errors are 1.14 and 1.2cr for Ode and w, respec- 
tively; these deviations are significant and cannot be ig- 
nored. We thus expect the impact of assembly bias will 
be stronger if the observable-mass relation has a large 
scatter and if Afobs is strongly correlated with c. The 
exact dependence of systematic error on these two quan- 
tities will be fully explored in iJ51 

4. IMPLEMENTATION 

4.1. Parameterizing the Observable-Concentration 
Correlation 

In the absence of assembly bias, we follow UTima fc Hul 
|2005) to parameterize the observable-mass relation 
P(lnMobs|M). Given halo mass AI, the corresponding 
log observables In Mobs are modeled as a Gaussian dis- 
tribution with mean In M + In Mbias — where Mbias spec- 
ifies the offset between the estimate mass and the true 
mass — and variance crj^^M- This parameterization serves 
as the standard case as we generalize /'(In Mobs |M) to 
P(lnMobs|M, c) for analyzing the effect of assembly bias. 

A priori, we do not know exactly how the estimated 
mass of a cluster Mobs will depend on the cluster's con- 
centration c, that is, the correct parameterization for 
P(lnMobs|M, c). In detail, this relation may depend on 
both physical and observational effects. However, we 
would like to demand a simple wish-list of properties of 
our parameterization: 

1. When marginalized over concentration, 
P(ln Mobs|M, c) should reduce to the Gaus- 
sian distribution P(lnAfobs|M) of the fiducial case 
(as required by eq. [15]), independent of any new 
parameters introduced (i.e. we should keep the 
total In Afobs-ln M scatter fixed) . 

2. In order to study how self-calibration is affected 
as the dependence of A/obs on c is "turned on," the 
parameterization should have a tunable parameter. 



TABLE 1 
Survey Assumptions 



Survey 




Bin Size 




Area 


■2m ax 




(h-iMe) 


(AlogioMobs) 




(deg2) 




SDSS (optical) 


1013.5 


0.5 


3 


7500 


0.3 


DES (optical) 


1013.5 


0.5 


3 


5000 


1 


SPT (SZ) 


1014.2 


1 


1 


4000 


2 


LSST (optical) 


1013.5 


0.5 


3 


20000 


2 



Note. — All surveys use cells of area 10 deg 



lid Az 



When this tunable parameter is set to zero, our 
analysis should reduce to the standard case. 

In the interest of simplicity, we take P(ln Afobs|M, c) to 
be Gaussian in In Mobs, and assume that the halo con- 
centration slightly shifts In Mobs relative to In M, so that 
the mean and the variance of In Mobs are given by 

(In AfobsiM, c') =ln Af + In Afbias + rai^uc' (20) 
Var(ln AUs | Af, c) = <jl^{l-r^) . (21) 

In the above expressions, r is the correlation coefficient 
between In Mobs and c' at fixed InM, crinM is the scatter 
in InAfobs at fixed M, and c' is defined via 



Inc - (lnc|A.f) 
v/Var(lnc|A/) 



(22) 



Note that when r = 0, all of the observed scatter in 
In A/obs at fixed In M is intrinsic, and our model reduces 
to the standard case. Conversely, for r = 1, the scat- 
ter in In Mobs at fixed In M is entirely due to the scatter 
in halo concentration at fixed mass. As a consistency 
check, we find that if we marginalize P(ln Aiobs|Af, c) 
over concentration (assuming a log-norrnal distribution 
for c at fi xed mass, see e. g. [Jing[2000l iBuUock et all 
120011 and iNeto et al.1 120071) . the resulting distribution 
P(lnMobs|M) is exactly that of the standard case; that 
is, our parameterization preserves the total scatter in 
In Mobs at a given InAf. 

4.2. Survey Assumptions, Cosmological Models, and 
Nuisance Parameters 

With the Fisher matrix analysis, we statistically fore- 
cast the systematic effects for four galaxy cluster sur- 
veys: the Sloan Digital Sky Survey (SDSS, lYork et all 
[2000t assuming the volume using photometric data), 
the Dark Energy Survey (DES^), the South Pole Tele- 
scope (SPT^), and the Large Synoptic Survey Tele- 
scope (LSST'). The survey areas are assumed to be 
7500 deg^ for SDSS, 5000 deg^ for DES, 4000 deg^ for 
SPT, and 20000 deg^ for LSST, with survey depths of 
^max = 0.3,1.0,2.0 and 2.0 respectively. The cells used 
for the counts-in-cells analysis are assumed to have an 
area 10 deg^ and redshift interval Az = 0.1. We as- 
sume clusters with Mobs > lO^'''^ h~-^MQ are observed 
by SPT, and perform no mass binning. For SDSS, DES, 



^ See http://www.darkenergysurvey.org/ 
^ See http: / /pole. uchicago .edu/] 
^ See http://www.lsst. org^ 
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Fig. 3. — Systematic errors d ue to ignoring existent assembly bias. Here we assume two sets of scatter and correlation values and 
perform the analysis discussed in i|3.3l with an SPT survey assumption and a WMAP3 cosmology (see ijlj. The open circles and dashed 
ellipses show the true parameter values and the 68% confidence regions with assembly bias correctly included. The solid circles and the 
solid ellipses show the estimated values and 68% confidence regions if assembly bias is completely ignored. The left panel shows that for 
a moderate assumption of o"inM = 0.1 and r = —0.5, the systematic errors are 0.22 and 0.23(t for SIde a-nd w, respectively; in this case 
the effects of assembly bias are ignorable. On the other hand, the right panel shows that for an extreme assumption of ai^M = 0.25 and 
r = —1, the systematic errors are 1.14 and 1.2ct for Hde and w, respectively; in this case the effects of assembly bias are significant. 



and LSST, the observational threshold is assumed to be 
-^^obs > lO^'^ '^ h~^MQ, and the counts in each of these 
surveys are binned in three observable bins. The survey 
parameters for all four surveys are detailed in Table [T] 

While the mass threshold of SZ observatio ns has lit- 
tle redshift dependence (e.g. ICarlstrom et al.l l2002). the 
mass threshold of optical surveys has more uncertainties. 
Clusters with mass 10^^'^ h~^MQ have been shown to 
be detectable, with high purity and completeness, with 
more than 10 bright red galaxies OAL^) in the SPSS 
photometric survey o ut to z ~ 0.3 (|Koester et al.l 120071 : 
I Johnston et al.|[2007D . We note that our choice of the 
minimum mass for the optical surveys assumes that such 
clusters can still be detected with high purity and com- 
pleteness out to the maximum redshift Zmax- This as- 
sumption may be reasonable out to z — 1, where clusters 
have been shown to have a robust red sequence, but the 
efficacy of this method will eventually break down at 
higher redshifts. In any case, it will need to be tested in 
detail with both realistic simulations and the data itself. 
We note that for LSST, one may wish to detect clusters 
using peaks in the lensing shear instead of from assump- 
tions about the galaxy distribution (e.g. iKaiseil Il995t 
iHennawi fc Spergell2005( l. in which case self-calibration 
could serve as a consistency check for the predictions for 
the observed shear signal made directly from simulations. 
In fj5l we consider one example case for LSST, which has 
similar assumptions to the lower z^ax optical surveys, for 
reference. 

In this work, we consider two sets of cosmological pa- 
ramcter s, narnely t he best fit cos mologies to WMAPl 
(ISpergel erall[200l and WMAP3 (|Spergel et all 120071) . 
whose parameter values are listed in Table [2l Both of 
them are flat ACDM cosmologies but differ mainly in 
the relative contribution of dark energy to the global 



energy density, in the normalization of fluctuations (5<^ 
or cTg), and in the spectral index (n). The impact of 
these differences on our analysis will be presented in 
^ In our statistical forecast, we do not put any pri- 
ors on dark energy parameters, but we assume Planck- 
like priors on the rest of the cosmological parameters 
(see Table [2]). Finally, i n our forecast mode ls we use 
the halo ma ss function b y lJenkins et al.l (|2001h . the bias 
function by ISheth etHl ()2001l ). and the assembly bias 
b''^{M,c) found by IWechsler et all ()2006l . this assump- 
tion was shown in Fig. 1). 

With regard to the observable-mass relation, our 
model involves three nuisance parameters: the bias in 
the estimated mass (InMbias), the scatter of InAfobs 
given InM (crinivi), and the cross-correlation coefficient 
between In Mobs and the normalized halo concentration 
c' (r). Throughout, we take InMbias = as our fiducial 
model. Our choice for the fiducial values for the scatter 
and the cross-correlation coefficient in each of the surveys 
requires further discussion. 

Let us first focus on the scatter. For an SPT- like sur- 
vey, the observational mass proxy is the SZ decrement of 
the cosmic microwave background due to the hot, ionized 
gas permeating the inter-cluster medium. At present, 
this scatter has only been predicted from numerical sim- 
ulations but has no t been determined from observations. 
IWhite et aP ()2002l ) argued that the three main sources of 
scatter are the evolution of the M-T relation, asphericity 
in the matter dis trib ution, and lin e-of-sight projection. 
iMotl et all (|2005f ) and'Nagail (|2006D showed that the scat- 
ter is 10%-15%, and the scaling relation is insensitive to 
the detailed physical processes involved in galaxy for- 
mation, with a good agreem ent with self-similar models. 
However, I Shaw et al.l ( 2007t l showed that at least 20% 
intrinsic scatter exists due to the internal properties of 
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TABLE 2 
Fiducial Cosmologies 



Cosmology 




to 


<5j(fc = 0.05Mpc-i) 


n 




ffi^ h 


WMAPl 


0.73 


-1 


5.07 X 10"^ 


1 


0.024 


0.14 


WMAP3 


0.76 


-1 


4.53 X 10"^ 


0.958 


0.0223 


0.128 



Note. — All of our forecasts assume Plank-like priors: (T(ln(^) — 
a(lnQrnh'^) — tT(lnOb/i^) — cr(n) — 0.01, except for Ode and w. 



galaxy clusters. They also demonstrated that this scatter 
could be reduced by choosing different aperture radius 
for defining Af and Y, or by removing cluster samples 
with many substructures. Moreover, it may be possible 
to reduce the scatter even fu rther using clus ter struc- 
tural properties. For example. lAfshordil (|2007f) proposed 
a "fundamental plane" among the cluster mass, the total 
SZ flux, and the SZ half-light radius i?sz,2; in simula- 
tions, this relation r educed the scatter in ma ss estimates 
to ^ 14%. Further, iHaugboelle et alJ (|2007l ) found that 
by constructing an empirical model for the SZ profile, 
which includes a scaling parameter rg , they could reduce 
the scatter down to 4%. In this work, we take the largest 
of these range of values, namely 20%, as our fiducial scat- 
ter for SPT. If SPT is insensitive to halo assembly bias 
for this largest possible scatter, then it will also be in- 
sensitive for smaller values of scatter. 

In optical surveys, the usual observational mass proxy 
is the optical richness, namely the galaxy number in 
a galaxy cluster. Other choices are also possible, in- 
cluding the total optical luminosity or combinations of 
parameters (e.g. iRvkoff et all [2008H Becker et al] 120071: 
iReves et al.l I2008D . Determining a reasonable choice 
for the scatter for a DES-like survey is somewhat less 
straightforward, as predictions from simulations are less 
robust and the scatter can highly depe nd on both the 
richne ss measure and the cluster finder. iGladders et al.l 
()2007t l applied a self-calibration analysis to a catalog 
from the Red-Sequence Cluster Surveys (RCS), finding 
the fractional scatter fsc to be 0.69 ± 0.20 and 0.7lto;i7 
(based on different priors). Using t he velocity informa - 
tion of galaxies in maxBCG clusters. iBecker et al.l ()2007l ) 
estimated that the optical richness had a mass dependent 
scatter which varied from about 0.75 for massive clusters 
to 1.2 for group scale objects. Cross-correlation with 
the X-ray data on these same clusters suggests a con- 
siderably smaller scatter of about 0.5 (E. Rozo et al. in 
preparation). Here, we choose 0.5 as our fiducial scatter 
value for two reasons: First, as we shall see, even with 
this amount of scatter, halo assembly bias has a signifi- 
cant impact on self-calibration studies; this small scatter 
thus provides a baseline value for the impact of assem- 
bly bias. Second, we note that current optical richness 
estimates have all used fairly crude measures of richness. 
We think it is highly probable that in the near future we 
will start seeing richness measures that are considerably 
more strongly correlated with mass than those used at 
present. Thus, we have opted to select a scatter value 
that is closer to the lowest scatter estimated in current 
samples. 

We now turn to the correlation relating observable and 



concentration, r. Currently, there are no observational 
constraints on r for either optical or SZ mass proxies. 
We also find no quoted values for the correlation between 
SZ and halo concentration in the liter ature, although we 
note that lReid fc Spergell (|2006D and lShaw et~al] ()2007f ) 
investigated the impact of halo concentration on the scat- 
ter in Ysz and found that a considerable fraction of the 
scatter in SZ is due to variations in halo concentration at 
fixed mass. In this work, we choose the fiducial value to 
be r = 0.4, which is the value observed in simulations of 
clusters using the hydrodynamical ART code (D. Rudd 
2008, private communication). 

The Mobs~c correlation r for optical clusters is some- 
what better understood. We are not aware of current 
observational constraints on the correlation between op- 
tical cluster richness and halo concentrations, although 
with a sufficiently large sample, this value could in princi - 
ple be me asured from lens ing data. IZentner et al.l (|2005l ) 
and Wech sler et al.l (j2006f ) have shown that the amount 
of substructures in a cluster-size halo is negatively corre- 
lated with halo concentration. On the other hand, selec- 
tion effects could modify this correlation. For instance, 
high concentration halos might, on average, be assigned 
higher richness than low concentration halos of the same 
mass due to the larger galaxy density near the cluster 
core. In this work, we choose r = —0.5 as our fiducial 
value, which is roughl y cons istent with the numerical re - 
sults of IZentner etall (|2005[ ) and lWechsler et all (|2006f l. 
We note that the results presented here assume that all 
of these parameters are constant with both redshift and 
mass. 

5. RESULTS AND DISCUSSION 

We now present the effect of assembly bias for a set of 
specific assumptions about galaxy cluster surveys. We 
focus on the systematic errors in the two dark energy 
parameters JIde and w when compared with the sta- 
tistical errors expected from each survey, \Sfl-DE\/o'nr,E 
and \6w\/aw Figure [4] shows how these two ratios vary 
with the scatter in lnMobs~lnAf (crinM) and the cross- 
correlations coefficient of InMobs-c' (r) for our fidu- 
cial SDSS, DES, and SPT surveys. AU plots assume 
a WMAP3 cosmology. As can be seen, a high degree of 
correlation and/or large scatter can result in significantly 
biased cosmological estimates for both DES and SPT, 
while for the current SDSS the statistical uncertainty is 
sufficiently large that halo assembly bias is insignificant. 

How halo assembly bias differently affects DES and 
SPT is worth discussing. For a fixed scatter and corre- 
lation coefficient, the cosmological constraints from DES 
are considerably less biased than those of SPT. The rea- 
son for this difference is two-fold. First, DES clusters 
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Fig. 4. — Systematic errors for Que ('eft) and w (right) estimators, as a function of scatter in the observable given mass (cri^ui) 
and the correlation between observable and halo concentration (r). The ratios of the systematic error and the statistical uncertainty 
{\S6\/ag) are shown for three of our main survey assumptions: SDSS, DES, and SPT, from top to bottom. High scatter value and strong 
corrclation/anti-correlation correspond to high deviation of estimators. We also mark the fiducial values of ci^jyi and r in each panel 
according to our current knowledge from observations and numerical simulations. We note that these plots are also applicable to other 
surveys (e.g. X-ray) for which the survey volume and mass threshold are the same as those assumed here. See |l5]for discussion. 
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probe a lower mass scale than SPT clusters do. This 
difference is important because the effect of concentra- 
tion on halo bias is important for high mass halos, but 
non-existent for halos of mass near ~ 10^^ Mq 
(jWechsler et alJ l2006l l. Consequently, the cosmologi- 
cal constraints coming from low mass clusters (groups) 
should be unbiased.* The second important difference 
between SPT and DES is that in our fiducial surveys we 
have assumed binned counts for DES clusters but only 
thresholded counts for SPT clusters. Consequently, all 
the cosmological information provided by the shape of 
the halo mass function (which is unaffected by assembly 
bias) does not contribute to the SPT constraints. Thus, 
SPT constraints are considerably more sensitive to the 
effects of assembly bias than the DES constraints given 
the same scatter and correlation coefficient. 

That is not, however, the end of the story. In order 
to fairly compare SPT to DES, one also needs to con- 
sider the regions of parameter space relevant to each of 
these surveys. We noted earlier that numerical simula- 
tions predict that the intrinsic scatter in the SZ signal 
is approxini a tely 20% or even l ess (e.g.lMotl et al.ll2005t 
iNagal 120061 : IShaw et all 120071 : IHaueboelle et all 120071) . 
As can be seen from the bottom panels of Figure [H 
these scatter values do not result in significant biasing of 
the recovered cosmological parameters for any value of 
r. Although the expected intrinsic scatter is small, the 
projection effec t may raise or even dominate the tota l 
scatter (see e . gJWhite et al.ll20Q2l: iHaliman et al.ll2007l) . 
Holder et al.l |2007[ ) found that the SZ background can 
generate errors larger than 20% in recovered fiux if as 
is near 0.7. If the extra scatter is due to the randomly 
aligned structures along the line of sight, then we do not 
expect assembly bias to have a significant impact. On 
the other hand, if the projection effect is dominated by 
nearby structures, the extra scatter due to projection will 
be strongly correlated with the environment, resulting 
in higher correlation between concentration and observ- 
able. In fact^ if the scatter due to projection is as high as 
[Holder et al.l ()2007[ ) predicted and is also dominated by 
nearby correlated structure, the effect of assembly bias 
may be strengthened. 

Photometric surveys like DES, in contrast, are very 
likely to be sensitive to the impact of assembly bias. In 
this case, we know that the opti cal richness-mass re- 
lation has a scatt er > 50% (e.g. iGladders et al.l [20071 : 
iBecker et al.ll2007L E.Rozo et al. in preparation). As can 
be seen in the middle panels of Figure [H even moder- 
ate correlations between Mobs and c, e.g. \r\ > 0.5, can 
result in significant biasing of the recovered cosmologi- 
cal parameters. It is likely, therefore, that cosmological 
analysis of the DES optical cluster sample will need to 
include halo assembly bias in order to avoid systematic 
errors in dark energy inference, unless the analysis can be 
done with an observable that is more tightly correlated 
with mass. On the other hand, DES will have additional 
mass measurements, including its weak lensing and SZ 
signals from SPT. If these measurements are included in 
the analysis, the effect of assembly bias may be dimin- 

* Although the low mass clusters (groups) are less affected by 
assembly bias, they are subjected to more statistical errors. For 
the most constraining power, the choice of mass threshold should 
be made based on the scatter, the completeness, and the purity. 



ished. Observational inference of assembly bias may even 
be possible with mass profile measurements. 

We note that in these figures, the only differences are 
between the survey volumes and the threshold in Mobs, 
without assuming any other information specific to the 
optical or SZ surveys. Therefore, these results are appli- 
cable to other surveys (e.g. X-ray surveys) with the same 
survey conditions. 

In Figure [5] we (1) explore the systematic effects due 
to assembly bias under different cosmological parame- 
ters (WMAPl and WMAP3) and (2) extend the calcu- 
lation to include an assumption for an LSST-like opti- 
cal cluster survey. (For SDSS, DES, LSST: r = -0.5 
and CTinM = 0.5. For SPT: r = 0.4 and ainM = 0.2.) 
First, the most relevant difference between WMAPl and 
WMAP3 is that WMAP3 has higher f^DE and lower nor- 
malization (5^ or as values, as listed in Table [H As a 
result, the WMAP3 cosmology has fewer clusters, and 
the sample variance of the clusters is smaller. These dif- 
fere nces inc rease the statistical errors of the surveys (see 
also lLima~& : Hu 2007.), thus making the impact of assem- 
bly bias less significant in the WMAP3 cosmology than 
in the WMAPl cosmology. Overall, our main conclu- 
sions remain unchanged. Second, for LSST, we find the 
systematic due to assembly bias is very significant for our 
fiducial values; this systematic is likely to be significant 
for even small values of ctihM and r. 

We especially note that the systematic of assembly bias 
impacts r^DE and w differently. From the survey point 
of view, increasing Zmax above 1 largely improves the 
constraints on w, but barely improves the constraints on 
r^DE- For w, the systematic error due to assembly bias 
increases monotonically with Zmax, while for I^de, this 
systematic error somewhat cancels and then changes its 
sign as Zmax increases. This difference is due to the fact 
that r^DE and w affect the observed large-scale structure 
differently in different regimes. Dark energy affects the 
observed large-scale structure through two mechanisms: 
the growth function and the comoving volume. High fioE 
and high w both result in stronger suppression of struc- 
ture. On the other hand, the volume dependence works 
differently: high fioE and low w correspond to larger vol- 
umes. The effects work in opposite directions for JIde! 
higher f^DE leads to less structure but more volume. Be- 
fore the onset of dark energy domination, the comoving 
volume effect dominates; after dark energy take-over, the 
growth function effect dominates. Thus near the onset 
of dark energy domination, the observed structure is in- 
sensitive to f^DE, leading to no extra information from 
this regime. The effects of assembly bias on fioE before 
and after the dark energy domination have opposite signs 
and thus cancel each other. On the other hand, for w, 
both effects work in the same direction; thus, including 
more survey volume will always increase the amount of 
information on w, and the systematic effects do not can- 
cel. That is why the systematic effect of assembly bias 
on w increases monotonically with Zmax- 

Another interesting question is how the constraints 
on cosmological parameters are degraded if we include 
r as an additional nuisance parameter that needs to be 
marginalized over. However, as we mentioned earlier, the 
fact that the likelihood function is non-Gaussian in r if 
r is close to ±1 implies that the Fisher matrix estimates 
may not apply. Therefore, the following constraints with 
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Fig. 5. — Impact of assembly bias for two different cosmolo- 
gies and four survey conditions. The ratio \59\/ag for SIde and 
w are plotted as circles and squares respectively. Solid and open 
symbols are for WMAPl and WMAP3 cosmologies respectively. 
DES and LSST are clearly sensitive to assembly bias, while SPT 
is marginally sensitive to it, with the effect being stronger for 
WMAPl than WMAP3. A current SDSS-like survey is not sensi- 
tive to assembly bias. (Fiducial values assumed for other parame- 
ters include: r = -0.5 and ai^M = 0.5 for SDSS, DES, and LSST, 
and r = 0.4 and trinM = 0.2 for SPT) 



marginalization over r are only to be taken as rough in- 
dicators. 

We use moderate values for r (0.4 for SZ and —0.5 for 
optical) to compare the cosmological constraints assum- 
ing (1) fixed r values, and (2) r to be a free parameter in 
the Fisher matrix. Table [3] contains our results for three 
of the survey assumptions. As can be seen, while the er- 
ror bars for DES are only slightly affected by marginal- 
ization over r, those for SPT increase by a factor of 2 to 3. 
The reason is again related to the mass binning; since our 
fiducial SPT survey does not include mass binning, there 
is no information about the shape of the halo mass func- 
tion, which, if present, can improve the constraints on the 
scatter in the observable-mass relation. In the absence 
of this shape information, the constraints on the scatter 
is modest, which means that marginalization of fioE and 
w over the acceptable region of the parameter space will 
reach areas with very large scatter. Since those areas are 
highly sensitive to the effects of halo assembly bias, the 
marginalized errors will be significantly larger. In the last 
row of Table [3] (SPT5), we assume five narrow observ- 
able bins for SPT with bin size AlogpAfobs = 0.2. In this 
case, the dark energy constraints are barely degraded af- 
ter marginalizing over r. Thus, mass binning is a crucial 
component of the data analysis for both DES and SPT 
to maximize their potential as cosmological probes. Note 
that, in all cases, r itself cannot be well-constrained like 
other nuisance parameters. Since the dependence on r 
only affects the sample variance but not the abundance, 
the information for constraining r is insufficient. 

6. SUMMARY 

Self-calibration analysis in galaxy cluster surveys relies 
on the dependence of the halo bias on mass to simulta- 



neously constrain cosmology and the cluster observable- 
mass distribution. Recent work has shown that halo 
bias is sensitive not only to halo mass, but also to sec- 
ondary parameters related to the assembly history. Here 
we consider the effect of halo concentration on the bias 
as a specific case of the secondary parameters (gener- 
ally termed assembly bias) , and show how it might affect 
self-calibration analyses. In particular, if halo selection 
depends on halo concentration, the observed clustering 
amplitude of the corresponding cluster sample will devi- 
ate from that of a random selection of clusters with the 
same mass distribution. This deviation in the observed 
clustering amplitude can result in biased inferences of 
cosmological parameters, depending on (1) the amount 
of scatter between halo mass and the observational mass 
proxy, and (2) the correlation between the mass proxy 
and halo concentration. For current surveys like SDSS, 
the statistical uncertainty is still sufficiently large that 
the systematic error due to assembly bias is negligible. 
On the other hand, for an SPT-like survey, the expected 
small amount of intrinsic scatter between the SZ decre- 
ment and halo mass suggests that the impact of assem- 
bly bias on parameter estimation is negligible; however, 
if the projection effect results in higher scatter in high 
density regions, assembly bias may have significant im- 
pact. For a DES-like survey, where the mass proxy is 
likely to have considerably larger scatter, we estimate 
that assembly bias can displace the recovered dark en- 
ergy parameters from their true values by about la. For 
an LSST-like survey, this systematic error can exceed 2a 
in w. In the last two cases, halo assembly bias may need 
to be explicitly included in the cosmological analysis to 
avoid biasing of the recovered dark energy parameters. 
We emphasize, however, that our analysis has assumed 
the specific dependence of ha l o bia s on halo concentra- 
tion found by Wcchslcr et al.l (j2006l ) . If this dependence 
is shown to be smaller at high masses, if the correlation 
relating the observable mass proxy and halo concentra- 
tion can be shown to be small, or if observables that are 
more tightly correlated with mass can be found, the effect 
will be mitigated. We have shown that binning in mass 
is crucial for both optical and SZ surveys, as marginal- 
ization over this correlation coefficient can increase the 
expected errors of dark energy parameters by a factor of 
a few if we only use thresholded counts. 
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TABLE 3 
Self-Calibration Constraints. 



Self-Calibration with Fixed r Self-Calibration Marginalized over r 



Survey 


f^DE 


w 


in Afbias 


2 

""inM 


f^DE 


w 




2 

""in M 


r 


SDSS 


0.066 


0.240 


0.411 


0.086 


0.074 


0.251 


0.460 


0.108 


0.294 


DES 


0.006 


0.045 


0.051 


0.022 


0.006 


0.047 


0.053 


0.025 


0.125 


SPT 


0.010 


0.076 


0.104 


0.028 


0.025 


0.177 


0.355 


0.149 


1.300 


SPT5 


0.009 


0.061 


0.079 


0.017 


0.010 


0.062 


0.087 


0.027 


0.357 



Note. — Cosmological constraints with fixed cross-correlation coefficient r, and with marginal- 
ized r. We assume a WMAP3 cosmology, and the nuisance parameters are the same as those in 
Fig. [5] After marginalization over r, the constraints from binned cluster samples (SDSS, DES, and 
SPT5) arc barely degraded, while the constraints from thrcsholded samples (SPT) are degrade by 
a factor of 2 to 3. This result demonstrates the importance of mass binning. In all cases, r cannot 
be well-constrained like other nuisance parameters since it only affects the sample variance but not 
the abundance. We emphasize, however, that the second part of this table are to be interpreted 
as rough indicators, since the likelihood function may not be Gaussian in r. 



APPENDIX 

BIASED PARAMETER ESTIMATION FROM INCORRECT MODELS 

In this section, we explicitly implement the modified Fisher matrix formalism developed in ^3.31 for the case in which 
both Pa(x\Q) and PB(x\ff) are Gaussian. Let /i(6') and C(0) be the mean and covariance matrix defining Pa(x\&) in 
model A, which is related to the likelihood function; let and be the corresponding quantities in model 

i?, which represent the observed data. Note that and C(0) contain the model parameter Q that we are trying 
to fit, while and contain the true parameter value Of The log-likelihood function of model A reads (up to a 
constant) 

2£ = -21ni(f|6l) =lndetC + (f-^)^C"i(f-/i) . (Al) 

Taking the derivative with respect to Q and averaging over x, the maximum likelihood estimator Q can be found by 
solving 

(2£ ,) = Tr[C-iC,,;(l - C-i(D)) + (D,,;)] = , (A2) 

where (D) = -I- {JP — fl){fl^ — fl)'^ and (D.i) = — jT)^ . We then set 9 = 9f + S9, linearize this equation 

with respect to S9, and solve for SO. 

To proceed further, we focus on two simple examples of interest. The first example is the effect of assembly bias; 
model A corresponds the standard self-calibration, while model B corresponds to self-calibration with assembly bias. In 
this case, model B changes the sample variance but not the mean; thus jl{0) — j2^{9) for all 9 values, but C{9) ^ 0^(6*). 
After linearizing with respect to SO, the linear equations for 69 read 

Tt{C-^C41-C-^C^ + ^C-^CjC-^C^SOj)} + 2^^fJC-^^l,^S9J =0 . (A3) 

i j 
After solving the linear equations, we obtain the parameter deviation 69 

50, = ^(F-i),,Tr{lc-iC,,C-i(CB - C)} , (A4) 

i 

where 

F,, = m;^C-Vj + iTr{C-iC,,C"iC,,} (A5) 

is the Fisher matrix of the Gaussian likelihood function. Note that the bias in the recovered parameters is proportional 
to the difference between models A and B. 

Note that in analyzing the data generated by model B using model A changes not only the recovered parameters 
but also their error bars. By performing a similar calculation, the modified Fisher matrix with systematics now reads 

F,, = + iTr[C-iC,,C-iCjC-icB] . (A6) 

The error bar for all parameters estimated in model A using the data generated by model B can be recovered by 
inverting F. However, in the case of counts- in-cells , the likelihood fun ction is not perfectly Gaussian; it is convolution 
of Poisson and Gaussian (see e.g. iLima &: IIuir2004l : iHu fc Cohrill2006[ ). The modified Fisher matrix thus reads 

F,j = a/^C-Vj + iTr[C-iS,,C-iS.,C-icB] . (A7) 
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As a second example, we consider the case in which model B changes the mean but not the va riance of the data. 
One example is the effect of modified gravity on the weak lensing shear cross power spectrum (e.g. iHuterer fc LindeJ 
l2007f ). Here model A is the General Relativity prediction, while model B is the modified gravitational prediction. In 
this case, jl{6) ^ fl^{d) while C{9) — C^{9). The linear equation for 69 j reads 

TT{Y^c-^c,^c-^c.J^^J}-2(p^-flfc-^^l.^ + 2'^^l^^c-^^I,^S9j^o , (A8) 

j j 

which is equivalent to 

69, = ^(F-1),,{(mB - m)^C-V,,} . (A9) 

i 

Our formalism thus provides a different and generalizable route of obtaining the systematic error. 
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